Identification of local multivariate outliers
نویسندگان
چکیده
Abstract The Mahalanobis distance between pairs of multivariate observations is used as a measure of similarity between the observations. The theoretical distribution is derived, and the result is used for judging on the degree of isolation of an observation. In case of spatially dependent data where spatial coordinates are available, different exploratory tools are introduced for studying the degree of isolation of an observation from a fraction of its neighbors, and thus to identify local multivariate outliers.
منابع مشابه
Local multivariate outliers as geochemical anomaly halos indicators, a case study: Hamich area, Southern Khorasan, Iran
Anomaly recognition has always been a prominent subject in preliminary geochemical explorations. Among the regional geochemical data processing, there are a range of statistical and data mining techniques as well as different mapping methods, which serve as presentations of the outputs. The outlier’s values are of interest in the investigations where data are gathered under controlled condition...
متن کاملIdentification of outliers types in multivariate time series using genetic algorithm
Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...
متن کاملSequential Application of Multivariate Outlier Test : a Robust Approach
Identification of outliers in multivariate data is not trivial. especially when there exists several outliers in the data. The classical identification method based on the sample mean and sample covariance matrix cannot always find them, because the classicd rnean and covariance matris are themselves affected by outliers. This problem is termed as masting7 because the outliers get maslied by ea...
متن کاملIdentification of Multivariate Outliers: A Performance Study
Three methods for the identification of multivariate outliers (Rousseeuw and Van Zomeren, 1990; Becker and Gather, 1999; Filzmoser et al., 2005) are compared. They are based on the Mahalanobis distance that will be made resistant against outliers and model deviations by robust estimation of location and covariance. The comparison is made by means of a simulation study. Not only the case of mult...
متن کاملNew Proposals in Multivariate Outliers Identification
Occurrences of outliers in multivariate time series are unpredictable events which may severely distort the analysis of the series. It may be noticed that a convenient way for representing multiple outliers consists in superimposing a deterministic disturbance to a Gaussian multivariate time series. Then outliers may be modelled as non – Gaussian time series components. The independent componen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013